A Practical Partial Parser for Biomedical Literature Summarization
نویسندگان
چکیده
We present a partial parser called TeLePaPa (TextLens Partial Parser) to identify subjects and predicate verbs (SPVs) in a sentence of abstracts of MEDLINE citations. The performance of TeLePaPa is the precision of 96.7% and 97.1% for the SPV detection, respectively, and the recall of 91.3% and 94.9%, respectively. We found that there was a similarity in the distribution of the pairs of SPV over different research topics in the domain. In addition, we found that the power law holds for the relationship of the number of citations uncovered by SPV pairs and its rank. That is, only a half of the pairs covered about 90% of all the citations. This fact enables us to efficiently scan the huge amount of biomedical literature.
منابع مشابه
APOLN: A Partial Parser Of Unrestricted Text
In this paper, we present APOLN (Analizador Parcial de Oraciones en Lenguaje Natural): a partial parser of unrestricted natural language sentences based on finite-state techniques. Partial parsing has been used in several applications: syntactic parsing of unrestricted texts, data extraction systems, machine translation, solving the attachment ambiguity, speech recognition systems, text summari...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملAbstraction Summarization For Managing The Biomedical Research Literature
ion Summarization for Managing the Biomedical Research Literature Marcelo Fiszman Thomas C. Rindflesch Halil Kilicoglu Lister Hill National Center for Biomedical Communications National Library of Medicine Bethesda, MD 20894 {fiszman|tcr|halil}@nlm.nih.gov
متن کاملResolving ambiguity in biomedical text to improve summarization
Access to the vast body of research literature that is now available on biomedicine and related fields can be improved with automatic summarization. This paper describes a summarization system for the biomedical domain that represents documents as graphs formed from concepts and relations in the UMLS Metathesaurus. This system has to deal with the ambiguities that occur in biomedical documents....
متن کاملCitation Handling: Processing Citation Texts in Scientific Documents
Title of thesis: CITATION HANDLING: PROCESSING CITATION TEXTS IN SCIENTIFIC DOCUMENTS Michael Alan Whidby Master of Science, 2012 Thesis directed by: Professor Bonnie Dorr Dr. David Zajic Department of Computer Science Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on ...
متن کامل